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I.  INTRODUCTION  AND  SUMMARY 


Let  X^.xJ15.  ...  ,  X^ ,  i  =  1,  2,  ...  ,c  be  samples  of  c  inde- 

1  1 

pendent  random  variables  with  continuous  cumulative  distribution 

(i)  * (i) 

functions  F  and  empirical  distributions  F 


F*(-1^(x)  =  0 


x  <  X 


Ci) 

1 


=  k/n.  xf1*  <  x  <  X,(lJ,  1  «  k  <  n. 

'  l  k  k+1’  i 


=  1 


X(i)  <  x. 
n. 

i 


We  assume,  without  loss  of  generality,  that  x|^  <  X^  ^  . . .  <  X^1^  . 

i 

To  test  the  hypothesis  that  the  population  distribution  functions 
are  identical,  Birnbaum  and  Hall1  introduced  the  statistics 


D(nlfn2,  ...  ,nc)  =  sup 

|F*<»Cx) 

-  F*fj)  fx)  |  , 

(1) 

x,i,j 

D+(n1,n2,  . . .  ,nc)  -  sup 

-  F*  CJ) Cx) ] 

(2) 

x,i<j 

i,j  =  1,2,  ...  ,c  which  are  generalizations  of  the  well  known  Kolmogorov- 
Smirnov  two  sample  statistics  D(m,  n)  and  D+(m,  n) . 


Under  the  null  hypothesis 


H  : 
o 


F(i)  =  F(j) 


i  J 


1,2, 


the  exact  small  sample  distributions  of  the  statistics  (1)  and  (2) 
were  determined  by  Birnbaum-Hall  using  difference  equations  that  could  be 
solved  recursively  and  tabled  values  of 


P[D(n,  n,  n)  <  r] ,  P[D(n,n)  *S  r] ,  P[D+(n,n)  <  r] 


were  developed  for  selected  values  of  n  between  1  and  40  and  of  r  =  k/n, 
k  1,2,  ...  ,n. 

1Z.  W.  Birnbaum  and  R.  A.  Hall,  "Small  Sample  Distributions  for  Multi-Sample 
Statistics  of  the  Smirnov  Type,"  Ann.  Math.  Stat.,  Vol.  31  (I960),  pp.  710-720. 
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PHECKDUIB  Plfl*  MJUK-MOT  F1U® 


The  limitations  on  the  tables  were  for  the  most  part  imposed  by  the 
speed  and  capacity  of  the  computer  necessary  to  solve  the  difference 
equations.  This  paper  utilizes  some  notions  from  graph  theory,  and  in  so 
doing,  provides  exact  small  sample  distributions  for  more  than  three 
samples  and  for  unequal  sample  sizes.  Representative  tables  are  included. 


II.  A  GEOMETRICAL  PERSPECTIVE 

For  a  geometrical  perspective  of  the  problem  we  parallel  in  the  next 
two  paragraphs  the  development  of  Birnbaum-Hall . 

*M\  *121  *  fcl 

The  c-dimensional  random  variable  [F  J  (x) ,  F  f  x ) ,  ...  ,  F  (x)], 

for  fixed  x,  takes  on  values  (k. /n, ,k_/n,,,  ...  ,k  /n  ),  k.  c  CO,  1,  ...  ,n.}: 

112  2  c  c  i  l 

i  =  1,  2,  ...  ,c,  which  lie  in  the  unit  hypercube.  The  transformation 
(x-.x-,  ...  ,x  )  into  (y.,y0,  ...  ,y  )  defined  by  y.  =  n.x.  maps  the  hypercube 
into  a  c-dimensional  hypervolume  with  sides  (n^.n,,  ...  ,nc)  and  the  random 
c-vector  into  points  (k^.k^,  ...  ,kc)  with  integer  valued  coordinates. 

Under  the  null  hypothesis  Hq:  i,j  =  1,2 . c  we  may  con¬ 

sider  the  c  samples  as  coming  from  the  same  population  with  N!  equally  likely 
ways  of  drawing  the  combined  sample  of  size  N  =  n^  +  n.,  +  ...  +  nc«  Now 
suppose  the  values  of  the  combined  ordered  sample  are  located  on  the  real  line, 
and  the  null  c-vector  is  used  as  a  counter  in  the  following  manner: 

Starting  at  a  value  less  than  min  {X^,x|^,  ...  ,x|c^l  with  the  counter 
equal  (0,  0,  ...  ,0)  and  moving  in  the  direction  of  increasing  magnitude, 

the  k th  coordinate  of  the  counter  is  incremented  by  one  whenever  a  sample 
(k) 

value  Xj  is  encountered.  In  so  doing,  a  1-1  correspondence  between  a  path 
in  c-space  from  (0,0,  ...  ,0)  to  (nj,n2,  ...  ,nc)  and  the  c  samples  is 
defined. 

The  total  number  of  such  equally  likely  paths  is  the  multinomial 
coefficient  N!/(nj!n2!  ...  r»c ! ) .  The  critical  region  for  the  test  will  con¬ 
sist  of  a  set  of  points  R  =  {(kj,  k^,  ...  ,kc)}  such  that  any  path  from 
(0,0,  ...  ,0)  to  (n^,n2,  ....  ,nc)  passing  through  R  will  cause  Uq  to  be 
rejected.  If  we  denote  the  number  of  paths  passing  through  R  as 
Q(nj,n2,  ...  ,nc;R)  then  the  probability  of  an  error  of  the  first  kind 
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will  be 


a  = 


Q(n1,n2, 


*vR) 


N'!/Cn1!n2! 


■^T 


III.  DETERMINATION  OF  EXACT  SMALL  SAMPLE  DISTRIBUTION 


Determination  of  the  exact  small  sample  distribution  of  (1)  and  (2) 
from  this  approach  involves  an  enumerative  process  to  determine 
QCnj,n2>  ...  ,nc;R).  The  set  R  defining  the  critical  region  suggests 
itself  as  R  =  {(k.,k7,  ...  ,k  )|  sup  |  n.k.  -  n.k.|  >  n.n.r}  and 

L  i  c.  J  1  1  J  1  j 


R+  =  {(krk2, 


’V 


sup  (n.k.  -  n.k.)  >  n.n.r]  corresponding  to 
i<j  J  1  1  ■*  1  J 


D(n^,n2,  ...  ,nc)  >  r  and  D+(n^,n2,  ...  ,nc)  >  r  respectively.  These  regions 
of  rejection  are  again  extensions  of  the  Kolmogorov-Smirnov  procedure; 
however,  there  is  no  theoretical  basis  for  their  limitation  to  this  form. 


Considering  the  paths  in  c-space  from  (0,0,  ...  ,0)  to  (n.,n?,  ...  ,n  ) 

2  l  -  c 

as  simple  paths  in  a  finite  directed  graph,  (see  e.g.  Berge  )  we  have  at 

3 

our  disposal  a  method  of  enumeration  appearing  in  a  paper  by  Pototchi 
which  seems  well  suited  to  computer  manipulation.  This  method,  while 
conceptually  analogous  to  the  difference  equation  evaluated  by  Birnbaum- 
Hall,  allows  us  to  accomplish  the  enumeration  necessary  to  evaluate 
Q(nj,n2>  ...  ,nc;R)  and  to  consider  the  number  of  samples  c  >  3  as  well  as 
unequal  sample  sizes  in  an  efficient  and  economical  manner. 


IV.  TABLES 

Table  1  describes  the  upper  tail  of  the  distribution  of  D,  and  contains 
the  probabilities  P[D(n^,n2>nj)  <  r]  for  5  <  ni>n2’n3  <  10  starting  at  the 
largest  increment  of  r  =  .1(. 1)1.0  for  which  the  tail  probability  equals  or 
exceeds  .20. 

2 

C.  Berge,  The  Theory  of  Graphs  and  its  Applications,  John  Wiley  f(  Sons, 

New  York,  1962. 

■*A.  Pototchi,  "A  Simple  Algorithm  for  Determining  the  Number  of  Paths  in  a 
Finite  Graph,"  Economic  Cyber  Studies  Res.,  Vol .  2  (1967),  pp.  81 -85. 
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Table  2  contains  the  probabilities  P[D(n,n,n,n)  <  r]  for  n  =  2(1)10 
corresponding  to  values  of  r  =  k/n  consistent  with  the  differences 
|F  ^ (x)  -  F  ^  (x)  |  which  can  occur. 

The  values  in  Table  1  for  the  triples  (n,n,n),  n  =  5(1)10  correspond 
to  those  computed  by  Birnbaum-Hall.  The  values  in  Table  2  for  the  4-tuples 

4 

(n,n,n,n),  n  =  5,10  are  consistent  with  those  Monte  Carloed  by  Gardner, 
et  al.  Taken  together,  these  provide  additional  verification  of  the 
accuracy  of  the  tables. 


4R.  H.  Gardner,  J.  E.  Pinder  III,  and  R.  S.  Wood,  "Monte  Carlo  Estimation  of 
Percentiles  for  the  Multi-Smirnov  Test,"  J.  Statist.  Comput.  Simul.,  Vol.  10 
(1980),  pp.  243-249. 


TABLE  1.  QUANTILES  OF  THE  BIRNBAUM-HALL,  OR  THREE  SAMPLE  SMIRNOV,  STATISTIC 


FOR  UNEQUAL  SAMPLE  SIZES. 


nl 

”2 

n3 

r 

P[D(n1,n2,n3)  <  r] 

s 

5 

5 

.6 

.81109 

.7 

.81109 

.8 

.97819 

.9 

.97819 

1.0 

1.00000 

5 

S 

6 

.6 

.67353 

.7 

.85609 

.8 

.94628 

.9 

.98457 

1.0 

1.00000 

5 

5 

7 

.6 

.75909 

.7 

.83058 

.8 

.96542 

.9 

.98776 

1.0 

1.00000 

5 

S 

8 

.6 

.74472 

.7 

.86485 

.8 

.97582 

.9 

.98947 

1.0 

1.00000 

5 

5 

9 

.5 

.46916 

.6 

.80259 

.7 

.88479 

.8 

.98176 

.9 

.99043 

1.0 

1.00000 

S 

5 

10 

.5 

.52097 

.6 

.83928 

.7 

.89685 

.8 

.98531 

.9 

.99100 

1.0 

1.00000 

nl 

n2 

n3 

r 

P[DCn1>n2,n3)  <  r] 

5 

6 

6 

.6 

.64253 

.7 

.89712 

.8 

.93216 

.9 

.98982 

1.0 

1.00000 

5 

6 

7 

.6 

.70972 

.7 

.87336 

.8 

.95008 

.9 

.99238 

1.0 

1.00000 

5 

6 

8 

.6 

.70123 

.7 

.88876 

.8 

.95968 

.9 

.99372 

1.0 

1.00000 

5 

6 

9 

.6 

.72578 

.7 

.91119 

.8 

.96508 

.9 

.99446 

1.0 

1.00000 

5 

6 

10 

.6 

.75753 

.7 

.92490 

.8 

.96825 

.9 

.99490 

1.0 

1.00000 

5 

7 

7 

.6 

.77668 

.7 

.84966 

.8 

.96547 

.  9 

.99462 

1.0 

1 .00000 

5 

7 

8 

.6 

.74794 

.7 

.88248 

.8 

.97350 

.9 

.99578 

1.0 

1.00000 
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TABLE  1.  (CONT'D) 


P[Dfn1 ,n2,n3)  <  r] 


nl 

n2 

n3 

r 

P[D(n1,n2,n3)  <  r] 

n2 

n3 

r  F 

1  [Dfn1 ,n2,n 

5 

7 

9 

.6 

.78143 

5 

10 

10 

.5 

.70953 

.7 

.90138 

.6 

.88851 

.8 

.97793 

.7 

.96359 

.9 

.99642 

.8 

.99235 

1.0 

1.00000 

.9 

.99870 

1.0 

1.00000 

5 

7 

10 

.5 

.54453 

6 

6 

6 

.6 

.68408 

.6 

.81483 

.7 

.93216 

.7 

.91269 

.8 

.93216 

.8 

.98047 

.9 

.99383 

.9 

.99680 

1.0 

i . ooooo 

1.0 

1.00000 

5 

8 

8 

.6 

.74256 

6 

6 

7 

.6 

. '4897 

.7 

.91251 

.7 

.91138 

.8 

.98065 

.8 

.94990 

.9 

.99685 

.9 

.99569 

1.0 

1.00000 

1.0 

1.00000 

5 

8 

9 

.6 

.78949 

6 

6 

8 

.6 

.74664 

.7 

.92949 

.7 

.90488 

.8 

.98457 

.8 

.95945 

.9 

.99745 

.9 

.99662 

1.0 

1.00000 

1.0 

1 . 00000 

5 

8 

10 

.5 

.62530 

6 

6 

9 

.6 

.74555 

.6 

.81808 

.7 

.92900 

.7 

.93951 

.8 

.96485 

.8 

.98681 

.9 

.99712 

.9 

.99780 

1.0 

1 . OOOOO 

1.0 

1.00000 

5 

9 

9 

.5 

.59437 

6 

6 

10 

.6 

.'8119 

.6 

.83481 

.7 

.94384 

.7 

.94532 

.8 

.96806 

.8 

.98821 

.9 

. 99739 

.9 

.99803 

1.0 

1 . OOOOO 

1.0 

1.00000 

5 

9 

10 

.5 

.65118 

6 

7 

7 

.5 

.49330 

.6 

.86208 

.6 

.81149 

.7 

.95464 

.7 

. 88975 

.8 

.99031 

.8 

.9651" 

.9 

.99836 

.9 

. 99719 

1.0 

1.00000 

1.0 

1 . OOOOO 
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TABLE  1.  (CONT'D) 


6  7  8 


6  7  9 


6  7  10 


6  8  8 


6  8  9 


6  8  10 


6  9  9 


* 

P[D(n1,n2,n3)  <  r] 

n_l 

n2 

n3 

r 

rillfiij  ,11.,, n 

.6 

.78637 

6 

9 

10 

.5 

.67974 

.7 

.90257 

.  6 

.  *4898 

.8 

.97319 

.  7 

.95453 

.9 

.99792 

.  S 

. 9901 1 

1.0 

1.00000 

.9 

.99934 

1.0 

1 . 00000 

.6 

.79154 

6 

10 

10 

.  5 

.70263 

.7 

.92361 

.6 

.87756 

.8 

.97765 

7 

. 96382 

.9 

.99830 

.  s 

.99217 

1.0 

1.00000 

.  9 

.99950 

1 .0 

1 . 00000 

.5 

.57700 

7 

7 

7 

.  5 

. 56209 

.6 

.82881 

.  0 

.86823 

.7 

.93638 

■7 

.86823 

.8 

.98023 

.8 

.97750 

.9 

.99851 

.9 

. 99830 

1.0 

1.00000 

1.0 

1 .00000 

.6 

.78567 

7 

7 

8 

.5 

.63985 

.7 

.91100 

.6 

.81671 

.8 

.98035 

.  / 

.89916 

.9 

.99855 

.8 

.98371 

1.0 

1.00000 

.9 

.99883 

1.0 

1 .00000 

.5 

.61580 

7 

7 

9 

.  j 

.61004 

.6 

.80505 

.  6 

.  S2936 

.7 

.92848 

.  7 

.91681 

.8 

.98430 

.8 

.98702 

.9 

.99888 

.  9 

.99909 

1.0 

1.00000 

1 .0 

1 . ooooo 

.5 

.65020 

7 

7 

10 

.  6 

.<>2594 

.6 

.83826 

.6 

.  8(i603 

.7 

.93879 

.7 

.92731 

.8 

.98659 

.8 

. 988SS 

.9 

.99906 

.9 

. 99922 

1.0 

1.00000 

1.0 

1 .00000 

.5 

.65335 

7 

8 

8 

.6 

.79392 

.6 

.81993 

.7 

.92685 

.7 

.94488 

.8 

.98888 

.8 

.98798 

.9 

. 99924 

.9 

.99918 

1.0 

1 . 00000 

1.0  1.00000 


r  1 
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TABLE  1.  (CONT'D) 


nl  n2  n3  r  P[D(n1,n2,n3)  <  r] 


7  8  9 


7  8  10 


7  9  9 


7  9  10 


7  10  10 


8  8  8 


8  8  9 


.5 

.64705 

.6 

.82084 

.7 

.94230 

.8 

.99158 

.9 

.99944 

1.0 

1.00000 

.5 

.68971 

.6 

.85147 

.7 

.95131 

.8 

. 99306 

.9 

.99955 

1.0 

1.00000 

.5 

.66124 

.6 

.84333 

.7 

.95640 

.8 

.99394 

.9 

.99962 

1.0 

1.00000 

.5 

.69235 

.6 

.87042 

.7 

.96455 

.8 

.99523 

.9 

.99970 

1.0 

1.00000 

.5 

.72037 

.6 

.89658 

.7 

.97223 

.8 

.99641 

.9 

.99979 

1.0 

1.00000 

.6 

.79392 

.7 

.95029 

.8 

.99292 

.9 

.99954 

1.0 

1.00000 

.5 

.67466 

.6 

.83749 

.7 

.96289 

.8 

.99494 

.9 

.99968 

1.0 

1.00000 

nl  n2  n3  r 


8  8  10  .5 

.6 
.7 
.8 
.9 
1.0 

8  9  9  .5 

.6 
.7 
.8 
.9 
1.0 

8  9  10  .5 

.  6 
.7 
.8 
.9 
1.0 

8  10  10  .5 

.6 

.7 

.8 

.9 

1.0 

9  9  9  .5 

.6 

.7 

.8 

.9 

1.0 

9  9  10  .5 

.6 
.7 
.8 
.9 
1.0 

9  10  10  .4 

.5 
.6 
.7 
.8 
.9 
1.0 


PfD(n1,n2,n3')  «  r] 


.  7473S 
.86362 
.96997 
.99601 
.99976 
1.00000 

.65584 

.87789 

.97370 

.99659 

.99980 

1.00000 

.71259 

.90160 

.97966 

.99745 

.99985 

1.00000 

.76924 

.92368 

.98495 

.99819 

.99990 

1.00000 

.71542 

.91350 

.98248 

.99785 

.99988 

1.00000 

.76978 

.93369 

.98714 

.99848 

.99992 

1.00000 

.53864 

.82180 

.95150 

.99104 

.99898 

.99995 

1.00000 
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TABLE  1.  (CONT’D) 


nl  n2 

n3 

r 

P[D(n1,n2,n 

10  10 

10 

.4 

.63715 

.5 

.86930 

.6 

.96645 

.7 

.99411 

.8 

.99936 

.9 

.99997 

1.0 

1.00000 
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TABLE  2.  QUANTILES  OF  A  FOUR  SAMPLE  SMIRNOV  STATISTIC  FOR  EQUAL  SAMPLE  SIZES. 


nl 

n2 

n3 

n4 

r  P [DO^ ,n2 ,nj,n4)  <  r] 

nl 

n2 

n3 

n4 

r  P[D(n 

1,n2,n3,n4)  <  r; 

2 

2 

2 

2 

.5  .22857 

8 

8 

8 

8 

.500 

.66901 

1.0  1.00000 

.625 

.91095 

.750 

.98651 

.875 

.99910 

1.000 

1.00000 

3 

3 

3 

3 

.333  .03740 

9 

9 

9 

9 

.444 

.56199 

.666  .64286 

.555 

.84982 

1.000  1.00000 

.666 

.96732 

.777 

.99574 

.888 

.99976 

1.000 

1.00000 

4 

4 

4 

4 

.25  .00526 

10 

10 

10 

10 

.5 

.78000 

.50  .35707 

.6 

.93879 

.75  .87219 

.7 

.98874 

1.00  1.00000 

.8 

.99875 

.9 

.99994 

1.0 

1.00000 

5555  .2  .00068 

.4  .18184 

.6  .69490 

.8  .95982 

1.0  1.00000 


6666  .500  .52300 

.666  .88076 

.833  .98824 

1.000  1.00000 


7777  .571  .77875 

.714  .95842 

.857  .99670 

1.000  1.00000 
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